**********************************
***** REPLICAION FILE README *****
**********************************

State Employment as a Strategy of Autocratic Control in China
Jaya Wen - January 2025

All figures and tables in the paper are created by "tables and figures.do". To execute the replication file, change the file path in line 27 of "tables and figures.do" to the local path where this replication package is saved. 

All relevant data are provided in the "replication package/data" folder in Stata format (.dta). Some raw data files are provided in Excel format (.xls).

Results were obtained using Stata/MP 18.0 in January 2025. The operating system was Windows 11 Pro, Version 23H2.


***************************
***** Data Dictionary *****
***************************

***** UHS Variables *****
** Relevant data is in data/uhs/*.dta
** Variable categories reported in data/uhs/UHS variable codes.csv
** data/cty_corr.dta and data/dist_corr.dta are averages of UHS data at the county and district level
dcode - anonymized household identifier
hcode - anonymized household identifier
year - year
prov - province code, China GuoBiao System
relation - relationship to household head
hukou - hukou status
birthyr - birth year
age - age in years
minority - indicator for minority ethnic group
sex - indicator for sex
pension - total pension payments received, RMB 
relief - total relief payments received, RMB
pork - total expenditures on pork, RMB
mutton - total expenditures on mutton, RMB
edugroup - level of education
yeareduc - years of education
sector - sector of employment
ownership - ownership of employment
occupation - occupation code
totinc - total income, RMB
salaryincome - salary, RMB
selfbusinessincome - income from self-employment, RMB
hhincome - household income, RMB
ddcode - anonymized household identifier
code - anonymized household identifier
empl - indicator for employment
empl_soe - indicator for employment in SOE
empl_oth - indicator for employment in non-SOE and non-private
nonsal - non-salary income, RMB
gbcode - GuoBiao county code
dist - GuoBiao district code
unemp - indicator for employment


***** Incident Variables *****
** data/proquest/proquest.dta
xjpq - number of incidents in each year 
lxjpq - lag number of incidents in each year
** data/proquest/xjpqloc.dta
xjpqloc - number of incidents in each year, omitting events triggered outside Xinjiang
lxjpqloc - lag number of incidents in each year, omitting events triggered outside Xinjiang
** data/proquest/xjpqnec.dta
xjpqnec - number of incidents in each year, omitting events with economic triggers
lxjpqnec - lag number of incidents in each year, omitting events with economic triggers
** data/proquest/xjpqbinary.dta
xjpqbinary - indicator for high-incident years
lxjpqbinary - lag indicator for high-incident years


***** Census Variables *****
** data/census/census/
uygcty00 - county-level Uyghur share, 2000 Census
uygcty10 - county-level Uyghur share, 2010 Census
uygcty90 - county-level Uyghur share, 1990 Census
huicty00 - county-level Hui share, 2000 Census
othcty00 - county-level non-Uyghur minority share, 2000 Census


***** CECC Variables *****
** Relevant data is in data/cecc.dta
** Additional information at https://www.cecc.gov/resources/political-prisoner-database
CECCrecordnumber - Unique identifier for the record in the dataset
detentionstatus - Status of the individual’s detention (e.g., detained, released)
issuecategory - Category of the issue or charge related to detention
mainname - Main name of the individual
Chinesecharactersmainname - The individual's main name written in Chinese characters
alternatenamelayorpen - Alternate name, possibly including a layperson's version or nickname
additionalnames - Other names the individual may go by
pinyinname - Name in Pinyin (Romanized Chinese)
ethnicgroup - The individual's ethnic group
sex - Gender of the individual
ageatdetention - Age of the individual at the time of detention
religion - Religious affiliation of the individual
occupation - The individual's occupation or profession
affiliation - Political or organizational affiliation of the individual
residenceprovince - Province where the individual resides
residenceprefecture - Prefecture (or regional area) where the individual resides
residencecounty - County where the individual resides
dateofdetention - Date the individual was detained
currentorlastprisondetenti - Status of the individual's detention in prison (current or last known)
currentorlastsentenceorti - Details about the current or last sentence or order to detainprovincewhereimprisonedordet - Province where the individual was imprisoned or detained
prefecturewhereimprisonedord - Prefecture where the individual was imprisoned or detained
countywhereimprisonedordetai - County where the individual was imprisoned or detained
legalprocess - Description of the leg


***** Province Yearbook Variables *****
** Relevant data is in data/state_yearbook.dta
provname - Name of the province
year - Year
gov_consumption - Government expenditure in the province
total_fixed_investment - Total fixed investment made in the province
state_investment - Investment made by the state
prov - GuoBiao province code
protestcount - Number of protests
pop - Population of the province
gdppc - GDP per capita in the province
soepctprov - Percent of SOE employment
privpctprov - Percent of private employment
lngdppc - Log GDP per capita


***** ASIP Variables *****
** Relevant data is in data/firm_regs.dta
gbcode - GuoBiao county code
year - year
prov - GuoBiao province code
sector_id - CIC4 industry code
soe - firm is SOE
dompriv - firm is domestic private
lnsales - logged total sales
LP - labor productivity
TFPR_HK - TFPR, Hsieh Klenow 2009
TFPR_OLS - TFPR, OLS 
TFPR_DLW - TFPR, De Loecker Warzinski 2012
TFPR_GNR - TFPR, Gandhi Navarro Rivers 2013
TFPR_DLWwage - TFPR, De Loecker Warzinski 2012, wage method
TFPR_OLSwage - TFPR, Gandhi Navarro Rivers 2013, wage method


